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Abstract 

We extend a newly developed formal system for the description of astrophysical 
maps. In this formalism, we consider the difference between maps to be the distance 
between elements of a pseudometric space (the space of all such maps). This ansatz 
allows us to measure quantitatively the difference between any two maps and to order 
the space of all maps. For each physical characteristic of interest, this technique assigns 
an "output" function to each map; the difference between the maps is then determined 
from the difference between their corresponding output functions. In this present study, 
we show that the results of this procedure are invariant under a class of transformations 
of the maps and the domains of the maps. In addition, we study the propagation of errors 
(observational uncertainties) through this formalism. We show that the uncertainties in 
the output functions can be controlled provided that the signal to noise ratios in the 
original astrophysical maps are sufficiently high. The results of this paper thus increase 
the effectiveness of this formal system for the description, classification, and analysis of 
astrophysical maps. 



Subject headings: methods: analytical - methods: data analysis 
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1. INTRODUCTION 



Although many different types of astrophysical data are found in maps, the science 
of form description for these types of data structures remains poorly developed (see, e.g., 
Lord k Wilson 1984). In a previous paper (Adams 1992; hereafter Paper I), we presented 
a mathematical formalism for the analysis and classification of astrophysical maps. This 
formalism considers the collection of all maps of a given type as a metric space. By 
defining distance functions (pseudometrics) on the space of maps, we can measure the 
difference between any two maps in a quantitative manner. We can also provide an 
ordering scheme for the space of maps by measuring the "distance" from each map to a 
well-defined reference map; we can then order the elements of the space (the maps) by the 
size of these "distances" . In Paper I, we presented the formalism and proved basic results. 
However, before this formal system becomes a fully functional astrophysical technique, 
we must show that it can adequately deal with the standard difficulties associated with 
astronomical data, e.g., observational uncertainties, calibration errors, varying distances 
to sources, etc. The overall goal of this paper is to prove a series of results which show how 
this formalism behaves in the presence of the aforementioned astronomical difficulties. 
Our hope is to place this formal system on a solid theoretical footing which will allow for 
its effective use in the analysis of astrophysical maps. In a companion paper (Wiseman 
k Adams 1994; hereafter WA), we illustrate the efficacy of this approach by applying it 
to a collection of continuum maps of molecular clouds; in particular, we show that this 
formal system provides an effective means of describing and classifying molecular clouds. 

Previous studies involving form description in astrophysics have often been concerned 
with the large scale structure of the universe (see, e.g., the reviews of Shandarin k 
Zel'dovich 1989; Melott 1990). For example, Gott, Melott, k Dickinson (1986) have 
studied large scale structure by calculating the topological genus of the surfaces separating 
high density regions from those of low density. Elizalde (1987) has presented a metric 
which measures the distributions of voids and clusters for point distributions (such as 
distributions of galaxies in the universe). Another topic that has been explored is the 
possibility that fractal structure appears in observed maps of molecular clouds (e.g., 
Bazell k Desert 1988; Dickman, Horvath, k Margulis 1990; Falgarone, Phillips, k Walker 
1991). Clumps and clump mass spectra in molecular clouds have been studied over 
the last decade (e.g., Larson 1981, 1985; Williams, de Geus, k Blitz 1994). Recently, 
more detailed studies of substructure within molecular clouds have begun (Myers 1991; 
Houlahan k Scalo 1992; Scalo 1990; Veeraraghavan k Fuller 1991; Wood, Myers, k 
Daugherty 1994; see also our companion paper). However, as emphasized by Elmegreen 
(1993) in his recent review, a well defined mathematical framework with which to describe 
molecular clouds (and other astrophysical maps) is badly needed. 

The fundamental concept involved in the formal system of Paper I is that we can 
consider astrophysical maps as elements of a metric space; the difference between any 
two maps is then the "distance" between two elements of a metric space (the space of all 
such maps). We measure this "distance" by constructing a distance function (a metric), 
written as 

d(<JA, <tb), (1-1) 
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where a a, <Jb are maps. As discussed in Paper I, these distance functions are actually 
pseudometrics and they make the space of all maps into a pseudometric space. The 
basic difficulty in this procedure is that the maps are often very complicated structures 
and hence the interpretation of any distance function becomes complicated. We thus 
want to simplify our interpretation of distance. We therefore "simplify" each map a 
by "measuring" some physical characteristic of the map. For example, we can easily 
determine what fraction of the map has values above a certain threshold level E; we 
thus obtain this fraction as a function of the threshold level. If we interpret the maps 
as a measure of density (or column density), we obtain a function which represents the 
distribution of density in the map. We denote this profile for a given map as m(a; E). We 
then measure the difference between any two maps by measuring the distance between 
their profiles of density, i.e., we find 

d[m{a A ), m(a B )} ■ (1.2) 

The distance as given by equation (1.2) now has a straightforward physical interpretation: 
the distance measures the difference between the distributions of density of the two maps. 
In addition to the distribution of density, we can construct other profiles (as a function 
of threshold level) which measure other properties of the maps, e.g., the number of parts 
the maps break up into, the shape of these parts, the self-gravity of the maps, etc. These 
profiles (usually one dimensional functions of the threshold level) are denoted as output 
functions. The essential feature of this formalism is that all of the above concepts can be 
made mathematically precise. We can thus obtain quantitatively meaningful results. 

In this paper, we extend our formal system for the study of astrophysical maps in two 
different ways: (1) We prove a series of results which show how the output functions (and 
hence the distance functions) vary when we make various transformations of the original 
maps. These transformations include scalings of the map values, rotations, translations, 
stretching the maps, and scalings of the map sizes. We also briefly discuss resolution and 
projection effects. (2) We study the propagation of observational uncertainties through 
this formalism. In particular, we show how observational uncertainties in the original 
maps produce corresponding uncertainties in the output functions. We find that the 
errors can be controlled provided that the signal to noise ratio of the original map is 
sufficiently large. 

This paper is organized as follows. In §2, we review the formal system for measuring 
the distance between astrophysical maps and ordering the space of all maps. We then 
extend this formalism in §3, where we prove several new results which quantify the manner 
in which transformations of the maps affect the output functions. In §4, we discuss the 
effects of random errors in the maps and other observational effects. We conclude in §5 
with a discussion and summary of our results. The mathematical details of the effects 
of transformations and observational uncertainties on the formalism are presented in the 
appendices. 
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2. REVIEW OF THE FORMALISM 



In this section, we review the formalism of Paper I. As discussed above, we consider 
each astrophysical map to be an element of an abstract space - the space of all possible 
maps (Paper I; see also Elizalde 1987). We thus consider the difference between two 
maps to be the "distance" between elements of this space. The space of all maps is thus 
a metric space (actually a pseudometric space) and our goal is to define distance functions 
(pseudometrics) on the space, t 

The procedure we use to define distance functions on the space of all maps con- 
tains two steps. For a given physical characteristic of interest, we first determine a 
one-dimensional function (denoted here as an output function) to each map; this output 
function represents a profile of some physically meaningful quantity (e.g., distribution of 
density - see below). For the second step of this procedure, we determine the difference 
between any two maps by finding the difference between their corresponding output func- 
tions. This difference, in turn, is measured using a standard distance function (denoted 
here as d) defined on the space of functions (for further discussion of standard distance 
functions and metric spaces, see, e.g., Copson 1968; Kelley 1955; Edgar 1990). 

This formal system also allows for the ordering of a collection of maps. This ordering 
is accomplished by assigning a positive real number - a coordinate - to each map, where 
the coordinate is defined to be the distance between the map and a well-defined reference 
state (or set of states). 

This entire procedure can be depicted schematically as: 



Y v 



d 



Y x ,d 



X = jcr | a is a map j 

do X 
(x,do X ) 

dox\a 

{coordinates} C R + 

In the above diagram, we have used the symbol x to represent the assignment of an 
output function to a given map. The space Y x is the space of output functions (one 
output function for each astrophysical map in the space X of all maps). We make the 
space of output functions into a metric space (Y x , d) by using a standard metric d (as we 
discuss below, we generally take d to be the usual L 2 norm). However, we can assign a 



t The difference between a metric space and a pseudometric space is that the latter 
can have two distinct elements with zero distance between them. A distance function that 
allows two distinct elements to be separated by zero distance is known as a pseudometric. 
In the study of astrophysical maps, we want to allow two different maps to be "the same" 
and hence we require a pseudometric space (see Paper I for further discussion) . 
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distance function d o x (where o denotes composition) directly on the original space of 
maps to produce a pseudometric space (X, d o %). The two spaces are related through a 
distance preserving function I known as an isometry. Finally, we assign coordinates to 
the maps through the operation denoted as d o x\ a , which measures the distance from 
the map to the nearest reference map o"o (see §2.4). Further details of this procedure 
are discussed in Paper I. In the following subsections, we describe some of the output 
functions that can be used for the description and study of astrophysical maps. 

2. 1 Distributions of Density and Volume 

As mentioned above, one approach to characterizing a map to determine how much 
of the material is at the highest densities [We note that the map need not be a density 
map - we use this interpretation only as a conceptual guide. In most astrophysical 
applications, the maps will be of either column density or intensity.] 

We can define the fraction of the material at high densities in two different ways. 
We first determine the fraction m of the mass in the map at densities higher than a given 
reference E: 

fd n x a(x) 6[<t(x) -E] 



m(cx; E) = 



(2.1) 



/ d n x a(x) 

where O is a step function and where the integrals are taken over the (bounded) domain 
D of the map. Notice that, for a given map cr, m is a function of one variable (namely 
E). We can also define an analogous function v(a; E) which measures the fraction of the 
volume (area in a 2-dimensional map) greater than the reference density E: 

JcPx 6[cr(x) — E] 



v(<t;E) 



(2.2) 



Given these definitions, we can define a distance between two maps by measuring the 
difference between their corresponding output functions (using either m or v), i.e., we 
define a pseudometrics d m and d v through 



d m (a a,&b) = 



d v ((JA, <Jb) 



1 f°° 

J dT, \m(a A ; E) - m(a B ; E)|' 



dT, \v(a A ; E) - v(a B ; E)| 



1/2 



1/2 



(2.3) 



(2.4) 



Notice that in all cases, both m = and v = at sufficiently large threshold densities 
E; the integrals in the above equations are thus always convergent. Notice also that we 
have made the metrics non-dimensional by normalizing the integrals with an appropriate 
reference density (E). 

The output functions m(cr; E) and v(a; E) have another useful interpretation. Let 
us define V m to be (minus) the derivative of the function m with respect to the variable 
E, i.e., 

dm J d n x a(x)<5[a(x) — E] 



Pm(cr;E) 



dE 



/ d n x <x(x) 



(2.5) 
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where we have used the fact that the derivative of a step function 6 is a delta function 
5(x). Similarly, we define V v via 

n(c7 ' E) = ~dE = Ja^ • (2 - 6) 

The quantity V v is the probability (per unit column density) of a point in the map a 
having the column density E. Similarly, the quantity V m is the probability (weighted by 
the mass) of a point in the map a having the column density E. It is straightforward 
to show that these probability functions are properly normalized, i.e., J V v dT, = 1 and 
/ VmdT, = 1. The interpretation of the derivatives of m and v as probability distributions 
greatly facilitates our understanding of how these output functions behave under various 
transformations (see below). 

2.2 Distribution of Components 

We now consider a diagnostic which can discriminate between different geometrical 
distributions of the high density material. For example, a given map could be composed 
of N separate "units" , each with mass M and a given distribution of density; alternately, 
the map could consist of a single unit (with mass NM) with the same distribution of 
density. Both maps would have the same profile m(E) as defined above, but would be 
quite different. One way to discriminate between these two cases is to count the number 
of topological components as a function of threshold density E. To be precise, we first 
define a reduced space: 

X+ = jx G D a(x) > E|. (2.7) 

For a given threshold density, the space (X£,cIe) has a well defined number n(a; E) of 
topological components (where oIe is the usual Euclidean metric). We can then define a 
pseudometric d n on the space X of all maps through 



d n {o-A,o- B ) = 



1 f°° 

.=r gE |n(0- A ;E) -n(cr B ;E) 
\^) Jo 



1/2 



(2.8) 



Since cr(x) is always a bounded function x£ = for sufficiently large E; thus, n = at 
sufficiently large values of E and the integral is convergent. 

Notice that this pseudometric provides us with topological information on two dif- 
ferent levels. For a given threshold density E, the function n(a; E) measures a topological 
property (the number of components) of the reduced space which is in turn derived 
from the original map a. When combined with our "standard" function space metric d, 
the component function n defines a pseudometric (namely d n = d o n) on the space X of 
all maps. 

2.3 Distribution of Filaments 

We also require some description which measures the shapes of the pieces of the 
map. Given the breakup of a map into components (as described above), we can obtain 
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a measure of the degree to which the components are filamentary (i.e., stringlike). We 
begin with the usual definition of the diameter V of a set A, i.e., 



P(A) = max||x-y| x,yeAj. (2.9) 

For a given threshold density, an astrophysical map breaks up into components as de- 
scribed in the previous section; each of these components has a well defined diameter. 
We can also calculate the area A of a given component. Notice that for a perfectly round 
(circular) component, the area and the diameter are related by the obvious relation A 
= nV 2 /4. In order to obtain a measure of the departure of a given component from a 
circular shape, we first define a factor Tj , which is simply the inverse of the filling factor 
for a given component, i.e., 

We denote the quantity Tj as the "filament index" of the jth component. We also define 
an average factor /: 

'<" e > = S(£e)2>*'* <2 ' n) 

where the sum is taken over all of the components and where / is explicitly written as a 
function of threshold density E. The quantities Wj are weighting values; we consider both 
an unweighted version of the filament index (wj = 1) and a weighted version in which 
each Tj is weighted by the fraction of material in that component (i.e., Wj = Aj/(A), 
where (A) is the average area of the components at the given threshold level). A highly 
filamentary map will thus have a very large value of /. The pseudometric df on the space 
of maps then can be written 



df(o~A, &b) 



dZ\f(<j A ;X)-f(* B ;Z)\ 2 



1/2 



(2.12) 



where / can be either the weighted or unweighted version of the filament function [2.11]. 

2.4 Assigning Coordinates 

In applications, we often require a method of putting astrophysical maps into some 
kind of well defined order. To order the space of all maps using this formal system, we 
assign "coordinates" to the maps by measuring the distance from a given map a to a 
well-defined reference map cq. The resulting coordinates are thus positive real numbers 
and the space of maps can be ordered by the size of these numbers. Notice that, in 
general, there will be different coordinates (and hence a different ordering) for each type 
of output function considered. 

As described in Paper I, we use uniform density maps (00 = constant) as reference 
maps. Notice that there are an infinite number of such reference maps - one for each 
possible value of the constant. We follow Paper I in defining the coordinate to be the 
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distance to the nearest uniform density map. Specifically, for a given pseudometric d x , 
we define the coordinate rj x by 

Co is a uniform density map j . (2.13) 

See Paper I for further details on implementing this minimization procedure. We can 
thus say that map a a is "greater than" map Ob provided that 

VxM > VxM ■ (2-14) 



n x = min|rf x (cr,cr ) 



In addition to the coordinates obtained from the output functions described above, 
we can define other coordinates. For example, in applications (see WA) we sometimes 
want to consider the "self-gravity" of a map. Thus, for a given map we define the quantity 

,^iAc/d»x/<ry^M, (2.15) 

where x and y are (n— dimensional) position vectors and where M w is a normalization 
factor which is chosen to make the coordinate a dimensionless quantity (see Paper I). 
The integrals are carried out over the entire map. The quantity r] w thus plays the role 
of an additional coordinate. If the map traces a mass distribution, then this coordinate 
provides a measure of the self-gravity of the distribution. 



3. BEHAVIOR OF THE FORMALISM UNDER TRANSFORMATIONS 

In this section, we present formal results which show how the formalism described 
in the preceding section behaves under various transformations of the maps. In particu- 
lar, we discuss how the output functions transform under scaling transformations which 
change the values of the maps, but not the spatial positions of the map pixels in the sky. 
Next, we study the effects of changing the domains of the maps for a general class of 
affine transformations (e.g., rotations, translations, stretching of the maps). 

3.1 Scaling Transformations 

We now discuss what happens to the output functions under various types of scaling 
transformations. For the sake of definiteness, we begin with the simplest type of scaling 
transformation, i.e., 

a -> 0<T, (3.1) 

where f3 is a constant. We will also consider more general types of rescalings of the maps. 

Scaling transformations (e.g., equation [3.1]) can arise in many ways. For example, 
overall calibration considerations can necessitate an overall scaling transformation of 
this type. More complicated scaling transformations arise in practice due to calibration 
problems/uncertainties (see the discussion of WA and Theorem 2 below). 
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Another example of scaling occurs due to the two dimensional nature of many as- 
trophysical maps (such as the IRAS maps considered in our companion paper). These 
maps only measure the column density cr(x 2 ), whereas we would like to know the true 
volume density p(^s), where we have explicitly used subscripts to refer to the number of 
spatial dimensions. We are thus limited to considering only the average of the volume 
density over the line of sight, t We can conceptually break up the observed map a into 
two parts through the ansatz 

a(x 2 ) = (p)(x 2 )L(x 2 ), (3.2) 

where £(x 2 ) is the depth of the object (along the line of sight) at position x 2 in the plane 
of the sky. In many applications, the interpretation of maps is driven by the assumption 
that the observed maps trace the mean density (p) and the function £(x 2 ) is slowly 
varying compared to (p). As we show in WA, this assumption can be justified by the 
analysis of real data (at least for some cases). In the sample of WA, the observed maps 
vary by several orders of magnitude in a. On the other hand, the filament index (which 
measures the shapes in the plane of the sky - see §2.3) is rarely greater than 3 and is 
typically ~ 2. Since we do not expect that the variation in map size along the line of 
sight is markedly different from the observed variations in the plane of the sky, we expect 
that the depth function £(x 2 ) varies only by factors of 2 or 3 across the face of a typical 
map. We can thus regard L(x 2 ) as a slowly varying function. We can now consider the 
question: What happens when we want to compare two maps which are essentially the 
same in all regards except that one map is twice as large as the other? If we can take 
the extreme limit where £(x 2 ) = L = constant for each map, then we are comparing two 
maps related by a scaling transformation of the form [3.1] where (3 = 1/L. 

Thus, it is useful to know how the output functions behave under scaling transfor- 
mations. For simple transformations of the form of equation [3.1], the answer to this 
question can be given as follows: 

Theorem 1. Let a be any map and let xi ", ^0 be any of the following output functions: 
distribution of density m(o~; E), volumev(o~; E), components n(o~; E), or filaments f(o~; E). 
Then the output function scales under the transformation a — > (3a according to xi ", ^) 

-x(*;E/0). 

A proof is given in Appendix A. Theorem 1 provides us with a very useful property 
of this formal system. For example, if we want to consider an alternate calibration of 
the original astrophysical maps, we can simply transform the output functions after the 
fact and need not recalculate them. As discussed above, we can also use Theorem 1 to 
compare maps of different physical sizes provided that we know the appropriate scaling 
factor (3 from independent observations. 

t We note that the column density is a measure of the average density only when the 
emissivity of the emitting medium is sufficiently well behaved. In the case of the IRAS 
maps considered in our companion paper (WA) , for example, the presence of temperature 
gradients in the maps can lead to interpretation problems - see Langer et al. (1989). 
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Given the dependence of the output functions on the scaling transformation [3.1], 
we can determine the dependence of the coordinates on scaling. This result can be stated 
as the following corollary, which follows directly from the definition of coordinates and 
Theorem 1: 

Corollary. Let a be any map and let x(a; E) be any of the following output functions: 
m(cr; E), v(a; E), n(a; E), or f(cr; E). Then the coordinate r\ x scales under the transfor- 
mation equation [3.1] according to rj x ((3a) = \f]3 r} x (a) . 

We now consider a more general type of rescaling law in which the maps are trans- 
formed according to 

a -> F(a)a, (3.3) 

where F is a given function of a. In order for this transformation to be "well behaved", 
we must make the further restriction that the quantity F(a)a is a monotonic function 
of a. Notice that we have written the transformation in the form F(a)a rather than 
simply G(a) for convenience; in the transformations of this type encountered for IRAS 
maps (see WA; Jarrett, Dickman, & Herbst 1989), F = 1 for small values of a and only 
departs significantly from unity at large map values. 

We now discuss how the output functions behave under this transformation. Sup- 
pose, for example, that we have already computed output functions for a sample of maps. 
We may want to consider an alternate calibration procedure (which will result is a trans- 
formation of the maps according to equation [3.3]). Since the calculation of the output 
functions is computationally expensive, we would like to be able to simply transform the 
output functions after the fact. 

We first note that, for a given threshold level E, the equation 

F(a)a = E (3.4) 

has (at most) one root because of our assumption that the left hand side is monotonic 
in a. We denote this root by a = E, which is simply a rescaling of the threshold level 
under the transformation. Given this definition, we can determine the manner in which 
the output functions transform under equation [3.3]. We state this result as the following 
theorem: 

Theorem 2. Under transformations of the form given in equation [3.3], the output 
functions transform according to x(a; E) — > x(cr; E), where x represents any of the four 
output functions m(cr;E), f(cr;E), n(a;E), or/(a;E), and where E is the root of equation 
M. 

A proof is given in Appendix B . Theorem 2 shows that even under complicated rescalings 
of the maps (such rescalings can easily arise in practice due to calibration problems), the 
output functions transform in a simple manner. 

3.2 Transformations of the Domain 

In this section, we study the dependence of the output functions under transfor- 
mations which change the domain of the maps. We concentrate on a general class of 
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transformations known as affine transformations and briefly consider the effects of reso- 
lution transformations. 

We begin with the general definition of affine transformations: 

Definition. An affine transformation W in two dimensions is a transformation of the 
form 



W 











X 




X 




y 




1 

y 





an ai2 

«2 1 ^2 2 



+ 



h 
b 2 



(3.5) 



We denote the matrix in the above definition as A and we assume that this matrix 
is non-singular, i.e., det A ^ 0. We are also interested in a particular type of affine 
transformation known as a similitude. 

Definition. A similitude is a special type of affine transformation in which the matrix 
A takes the form 



r cos 6 — r sin 9 
r sin 6> r cos 9 



or 



r cos 6 r sin 6> 
r sin 6> — r cos 6 



(3.6) 



Thus, a similitude corresponds to a rotation (through angle 6>), a scaling of the domain (by 
the factor r), a translation (by [61,62]), and a possible reflection. Affine transformations 
are more general and include the possibility of stretching the x direction and the y 
direction by different amounts (see, e.g., Barnsley 1988 for further discussion of affine 
transformations) . 

For astrophysical maps a, we think of affine transformations as changing the domain 
D of the map. For example, we generally do not know precisely the distance to a given 
object. If we want to conceptually "change the distance" to an object, the corresponding 
change in the map is a similitude (where r is the ratio of the new distance to the old 
distance, 9 = 0, b\ = = 62). In general, an affine transformation acts on an astrophysical 
map according to 



x 



z =a{x,y) 1 



W 



x 



V 

a(x,y) 



(3.7) 



Given these definitions we can state the main result of this section: 



Theorem 3. The output functions m(cr;E), v(a; £), and n(cr; E) are invariant under 
general affine transformations, whereas the distribution of filaments /(cr; E) is invariant 
under similitudes. 

Although the behavior of the output functions under affine transformations is relatively 
easy to understand, it is somewhat more difficult to actually prove what happens. In any 
case, the proof is presented in Appendix C. Theorem 3 shows that the output functions 
transform in a fairly simple manner under trasformations which are often encountered 
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in astrophysics; this result thus provides another justification for using the formalism of 
Paper I. 

3.3 Resolution Transformations 

In this section, we briefly discuss the effects of resolution on the maps. We begin by 
defining a resolution transformation TZ according to 

H[<r(x)] =Af J dV exp[-(x / - x) 2 /LV(x') , (3.8) 

where M is a normalization factor for the transformation. If we normalize the transfor- 
mation such that TZ(1) = 1, we obtain the normalization factor M = 1/nL 2 . Notice that 
we have explicitly written the integral in two spatial dimensions, since the transformation 
occurs in practice due to telescope beams averaging signals within the plane of the sky. 
Here, L is the physical length scale corresponding to the given beam size of the telescope. 
Let us also introduce a constant physical length scale R, which corresponds to the length 
scale on which the true map a varies. We then define 

A = 2R 2 /L 2 . (3.9) 

It is easy to see that in the limit A 1, the integral in equation [3.8] averages out much 
of the structure in the map and we are left with the age-old astronomical problem of loss 
of information. In the opposite limit, A ^> 1, we can perform an asymptotic expansion 
(Appendix D) of the transformation in order to quantify this loss of information. In this 
limit, the leading order change in the map due to a resolution transformation is given by 

Tl[a} = a + ^A 2 a + 0{\~ 2 ) , (3.10) 

where A2 is the two-dimensional Laplacian operator in dimensionless units (see equation 
[D5]). Thus, we see explicitly the expected result that, under a resolution transformation, 
the peaks of the map become less pronounced (A20" < 0) whereas the low points (valleys) 
of the map become filled in (A 2 cr > 0). Thus, the resolution transformation [3.8] leads 
to a loss of dynamic range in the map and the size of this effect is ~ (A 2 cr)/X. 



4. PROPAGATION OF ERRORS 

An important issue which must be addressed is the effect of observational errors on 
the classification scheme outlined above. In particular, we must determine the effects of 
observational errors on the output functions and on the corresponding coordinates. For 
this present discussion we assume that the observed value <7 b s of the map at a particular 
spatial point and the uncertainty Aa in the map at that point can be written 

£>obs = o-r + Act, (4.1) 

where or is the true value of the map at that point. The uncertainty Act is usually 
considered to have a Gaussian distribution. 
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The goal of this section is to show that the output functions used here to describe 
astrophysical maps are sufficiently "well behaved" in the presence of uncertainties in the 
maps. In particular, we want to show that small uncertainties Act in the original maps 
do not lead to large variations in the resulting output functions. Fortunately, by working 
within the formal system of Paper I, we can provide a mathematically precise answer to 
this question (see Appendices E, F, and G). As we discuss below, the net result of this 
analysis is that observational uncertainties can be well controlled within this formalism 
provided that the signal to noise ratios in the original maps are sufficiently high. 

For the distribution of density m(cr; E), it is fairly straightforward to determine the 
relative error Am/m in the output function in terms of the uncertainty in the map; we 
find 



Am 1 



m m 



dm 



ciE 



Aa, (4.2) 

2-i 



where we have used the results of Appendix E. The errors are thus well controlled: small 
uncertainties Act in the maps lead to small relative uncertainties Am/m in the output 
functions. In principle, the slope dm/dT, can be quite large at a particular value of E 
and the corresponding error could be large; in practice, however, the right hand side 
of equation [4.1] does not become overly large (see WA). Notice that we can average 
equation [4.2] over all threshold densities E and thus obtain the average relative error in 
the output function; we find 

Am \ _ log(SNR) 

m J SNR ' 1 } 

where SNR denotes the signal to noise ratio (which we have assumed to be uniform over 
the map). We thus conclude that the distribution of density is sufficiently well behaved 
provided that the signal to noise ratio in the map is sufficiently large. The distribution 
of volume v(a; E) can be treated similarly. 

We now consider possible errors in the distribution of components n(cx;E). This 
present discussion is complicated by the fact that relatively small errors in the map can, 
in principle, produce rather large errors in the number of components. For example, 
the distribution of components n(cr; E) can have extra components due to erroneous 
pixels sticking up above the threshold level. Notice, however, that the opposite does not 
generally occur - we seldom get extraneous components due to a pixel being too low. A 
pixel with an erroneously low value will show up as a "hole" in a component but will not, 
in general, change the number of components. 

We must estimate the probability Pi that a pixel in the map will be erroneously 
larger than a given threshold level. This estimate is given in Appendix F and shows 
that the probability Pi is bounded to be quite small; very roughly we obtain Pi in the 
range 10 -2 to 10 -3 . Thus, in a map containing N x N pixels, the number of pixels 
erroneously appearing larger than the threshold E is bounded to be less than N 2 Pi. For 
the representative values N = 400 and Pi = 10~ 2 (these values are appropriate for the 
maps considered in WA), we have iV 2 Pi = 1600. Although most of these erroneous pixels 
are not expected to produce erroneous components, this number is uncomfortably large. 
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In order to control the number of erroneous components, we should eliminate all single- 
pixel components from the function n(a; E). The probability that two adjacent pixels are 
both erroneously above a threshold is proportional to and the number of (possible) 
erroneous components is bounded by 2N 2 P 2 ~ 32 (for the same values of iV and Pi as 
before). Although the number of spurious components with two pixels is bounded to be 
fairly small, we should adopt the conservative approach and also eliminate all two-pixel 
components from consideration. The smallest components that are kept in the function 
n(cr; E) are those with three pixels and the probability of obtaining a spurious component 
is proportional to Pf ~ 10~ 6 and the number of spurious components is expected to be 
less than ~ 6N 2 Pf ~ 0.96 < 1. We therefore conclude that the errors in the distribution 
of components can be well controlled, provided that we eliminate from consideration 
all components with only one or two pixels. We note that in practice the values of N 
and Pi may differ from those taken here for purposes of illustration, although we have 
been reasonably conservative here. The exercise outlined above should be performed for 
each application of the formalism and, in some cases, components with three (or even 
more) pixels might need to be removed in order to control errors in the distribution of 
components function n(a; E). 

Another issue which must be mentioned is the choice of the step size of the threshold 
level E used in computing the distribution of components output function. This step size 
should be related to the noise levels in the maps used. If the step size is much smaller 
than the RMS noise of the map, then extraneous noise can appear in the output function 
itself. On the other hand, if the step size is much larger than the RMS noise level, then 
information will be lost and we will lose some ability to detect structure. 

For the distribution of filaments f(a; E), the relative error is approximately given by 



where fs is the average filament index of the spurious components and fu is the average 
filament index of the real components (see Appendix G). Here, An/tiR is the relative 
error in the distribution of components function. The quantity in brackets in equation 
[4.4] is expected to be of order unity or smaller (if fs ~ then the errors nearly 
cancel). The quantity fs represents the average shape (filament index) of the spurious 
components. Since these components are made up of pixels which are erroneously on the 
wrong side of the threshold level, and since the probability of erroneous pixels occurring is 
small, these components are generally made up of a small number of pixels. The possible 
shapes of such components can be computed and the average filement index for such 
shapes is not far from unity. Thus, the relative error in the distribution of filaments is 
well controlled provided that An/n^ is small, i.e., provided that the number of spurious 
components is small compared to the number of true components. As we have argued 
above, however, the number of spurious components can be controlled by considering 
only those components with three or more pixels; thus, An can be made small and hence 
Af can also be controlled. 

Although we have shown that uncertainties Act in the maps do not produce unac- 
ceptable large uncertainties in the output functions themselves, we still must show that 




(4.4) 
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the corresponding uncertainties in the coordinates are sufficiently well behaved. Fortu- 
nately, Theorem 3 of Paper I shows that when two maps a a and a b are close together 
in the space of maps, 

d x (a A ,a B ) < e, (4.5) 

then the corresponding coordinates (tja and rjs) are close together in the space of real 
numbers, i.e., 

\r]A - Vb\ < e. (4.6) 

This result follows directly from the fact that we have built this formalism using pseudo- 
metrics (which are uniformly continuous when considered as functions of the maps - see 
Paper I; Copson 1968). Here, we let a a = Oobs denote the observed map, and we let o~b 
= an denote the true map, i.e., the map we would have if there were no observational 
uncertainties (see equation [4.1]). For any given type of output function x, the results of 
the previous analysis show that the difference between the output function Xobs 

of the 

observed map and the output function xr of the true map is small, i.e., 



Xobs = XR + Ax , 



(4.7) 



where Ax is small compared to xr- Using the definitions of the pseudo metrics (see 
equations [2.3], [2.4], [2.8], and [2.12]), we find that the difference between the true map 
and the observed map is given by 



d x i.O-R, Cobs) = 



1 



1/2 



0(|A X |) 



(4.8) 



Using equation [4.8] and Theorem 3 of Paper I, we find that the coordinates of the true 
and the observed maps are also "close together" in the sense that 



\vR-Vob s \ < 0(\A X \) 



(4.9) 



We thus conclude that the uncertainties in the coordinates are sufficiently well behaved 
in this formalism. 



5. SUMMARY AND DISCUSSION 

In this paper, we have extended the formalism for using metric space techniques 
to study astrophysical maps. In particular, we have proved a number of useful results 
concerning this method of form description. These results are summarized below. 

Our first result applies to scaling of the output functions due to the scaling of the 
original maps. Such scalings can arise from calibration considerations. In addition, if 
two maps are similar except for their size, then the corresponding column density maps 
should show similar structure except for an overall scaling factor f3. If we apply a scaling 
transformation a — > (ia to a given map, the output functions transform in a relatively 
simple manner, i.e., x(cr; E) — > x(P°"i ^) = x{°"i ^/P)- This transformation thus allows us 
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to compare maps which are known to have different sizes. We have also shown that under 
more general transformations of the form a — > F(cr)cr, the output functions transform 
according to x(cr; E) — > x(cr; £), where E is the root of equation [3.4]. 

Our next result shows how the output functions are affected by a general class of 
transformations of the domain of the maps. This class of affine transformations includes 
rotations, translations, and an overall stretching of the domain. The output functions 
for the distributions of density, volume, and components are all invariant under gen- 
eral affine transformations. The distribution of filaments is invariant under a restricted 
class of transformations known as similitudes, and is not invariant under general affine 
transformations. 

We have found one additional property of this formal system. For any given as- 
trophysical map, the distribution of density output function is always greater than the 
distribution of volume (we prove this result in Appendix H) . This property of the output 
functions is evident for the maps considered in our companion paper (WA). 

We have discussed the propagation of errors in the original maps into the correspond- 
ing errors in the output functions. The distribution of density function is well behaved in 
the sense that small map errors lead to small errors in the output function. The distribu- 
tion of volume function behaves similarly. The distribution of filaments is well behaved 
provided that the number of components can be well determined. On the other hand, 
the distribution of components is potentially more problematic. The function n(cx; E) can 
have considerable errors due to single erroneous pixels sticking up above the threshold 
level. However, we can estimate the probability of a spurious component arising with 
k pixels. This probability is oc Pf (where Pi is expected to be a small number - see 
Appendix F) and thus the probability of obtaining a spurious component with a large 
number k of pixels is quite small. In practice, we expect that by removing from consider- 
ation all components with only one or two pixels, we can adequately control the errors in 
the distribution of components n(a; E). Finally, we have shown that uncertainties in the 
coordinates are small as long as the corresponding uncertainties in the output functions 
are small. 

This present paper (in conjunction with Paper I) provides a well-defined formal 
system for the quantitative description of astrophysical maps and other similar data 
structures. This formal system can be directly applied to many types of existing astro- 
physical maps (see, e.g., WA for an application to continuum maps of molecular clouds). 
In addition, this method of form description can have applications outside the field of 
astrophysics. For example, studies of computer vision and artificial intelligence require 
the construction of explicit and meaningful descriptions of physical objects from images 
(see, e.g., Ballard & Brown 1982). 

In addition to direct applications of this formal system, many possibilities for future 
developmental work remain. So far, we have only used the most basic physical quantities 
(e.g., fraction of high density material by mass and by volume, number of pieces, and 
shape of the pieces) as the basis for our output functions. As the study of astrophysical 
form description becomes better understood, additional output functions should be de- 
veloped to provide a better description of these maps. These additional output functions 
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should be constructed so that they are either invariant under the transformations dis- 
cussed in §3 or they transform in a simple manner. These new output functions should 
also be "well behaved" in terms of their sensitivity to observational uncertainties in the 
maps. 
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APPENDIX A: PROOF OF THEOREM 1 

In this Appendix, we present a proof of Theorem 1, which states that the output 
functions scale under the transformation a — > (3a according to x(<r; E) — > x(a;E//3), 
where x(cr; E) represents any of the following output functions: distribution of density 
m((j; E), volume v(a; E), components n(a; E), or filaments f(a; E). 

Proof: For the distribution of density m(cr; E), we begin with the definition of m, i.e., 

/d"x a(x)e[q(x)-E] 
m(a;E) = p- L —— -. (Al) 

Under the transformation [3.1] we obtain 

ftPx q(x)0[Mx)-E] 

m{(3a; E = 1 J -, A2 

J a n x a(x) 

where we have canceled the constants (3 in the numerator and the denominator. If we 
then invoke the identity for step functions, 

Q[ax-c] = e[x-c/a\, (A3) 

where a and c are constants, we obtain the scaling relation 

m(/3a;E) = m(a;E//3). (A4) 

Similarly, using the definition [2.2] for the distribution of volume and the scaling 
transformation [3.1], we can write 

[cTx 0[Mx) - El , , , 
vtfa; E) = 1 = v(a; E//3), (A5) 
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where the second equality follows from applying the identity [A3] for the step function. 

Since the distributions of components n(a; E) and filaments f(a; E) measure prop- 
erties of the reduced space X£, we must begin with its definition 



X+ = jx G D a(x) > S|. 



(A6) 



Under the transformation a — > (3a, the reduced space becomes 



X+ ->|x G L> #t(x) > e} 
={x£D a(x) > E//3 j . 



We thus have the result 



(A7) 



(A8) 



Since both output functions n(a; E) and f(a; E) depend only on the properties of the 
reduced space, the relation [A8] implies that 



n((3a; E) = n(a; S//3), 



(A9) 
(A10) 



which concludes the proof. 



APPENDIX B: PROOF OF THEOREM 2 

In this Appendix, we present a proof of Theorem 2, which describes how the output 
functions transform when the maps are rescaled as in the nonlinear transformation of 
equation [3.3]. 

Proof: To consider the effect of the transformation on the distribution of density function 
m(cr; E), we use the interpretation that —dmjdY, is the probability that the map has a 
value E. Thus, the probability that the transformed map has a value of some threshold 
E is the probability that 

F(a)(T = E, (Bl) 
which is simply the probability that a = E, which, in turn, is given by 

dm 



dE 



(E). (B2) 



The transformed output function mj can then be written in terms of the untransformed 
output function through 

dm 



mx = 



E 
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If we evaluate the integral and use the fact that m vanishes as E — > oo, then we find 

m T (a;E) = m(a;E). (B4) 

Using a similar argument, we can write the transformed output function vt for the 
distribution of volume in terms of the original untransformed one via 

v T (o-;E) = v(a;T,). (B5) 

Since the distributions of components n(a; E) and filaments f(a; E) measure proper- 
ties of the reduced space X^, we must consider how the reduced space transforms under 
equation [3.3]. This argument thus parallels that of Appendix A. Under the transforma- 
tion of equation [3.3], the reduced space becomes 

I s + ^{xGD F(a)a> e) 

^ J (B6) 
= |x G D <r(x) > E| , 

where we have used the definition of E. We thus have the result 

X+(F(a)a)=X±(a). (B7) 

Since both output functions n(cr; E) and f(a; E) depend only on the properties of the 
reduced space, the relation [B7] implies that 

n(F(a)a;E) = n(a;E), (B8) 

/(F(a)a;E) = /(a;E), (B9) 

which concludes the proof. • 



APPENDIX C: PROOF OF THEOREM 3 

In this Appendix, we present a proof of Theorem 3, which states that the output 
functions m(a; E), v(a; E), and n(a; E) are invariant under general affine transformations, 
whereas the distribution of filaments f(a; E) is invariant under similitudes. 

Proof: We begin by noting that the integral of a function F transforms under an affine 
transformation according to 

J dxdxF(x,y) — > | det A\ J dxdy F(x,y), (CI) 

where A is the matrix of the transformation and where F is constant under the transfor- 
mation (see, e.g., Barnsley 1988). 
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Thus, under an affine transformation, the integrals appearing in the definitions of 
the distributions of density and volume transform according to 



J cTx <x(x) e[a(x) -E] -> |detA| J cTx <x(x) 0[<t(x) - E] (C2) 

y cTxcx(x) -> |detA| ^ cTx a(x), (C3) 
y cTx 6[<r(x) -E] -> |detA| y cTx 0[<t(x) - E] (C4) 

y cTx -> I det A| y ePx . (C5) 

Since the distribution of density m(a; E) is the ratio of the integral appearing in equation 
[C2] to that in equation [C3], m(cr; E) is invariant under the transformation as long as 
| det A\ 7^ 0. Similarly, equations [C4 - C5] imply that the distribution of volume v(a; E) 
is also invariant under affine transformations. 

To study the effects of affine transformations on the distributions of components 
n(a; E) and filaments f(a; E), we must first consider the effects of the transformation on 
the reduced space defined by equation [2.7], i.e., we consider the transformed space 

ic(4) = {ic(x) xeD,(r(x)>Ej. (C6) 

If we write the original reduced space as a union of its components 

n(<r;E) 

X+= |J U 3 , (C7) 
i=i 

where each Uj is a component, then we can write the transformed space as 

n(cr;£) 

w(X+)= |J w(Uj). (C8) 
j=i 

To show that the output function n(cr; E) is invariant under the transformation, we must 
show that the transformed sets w(Uj) are in fact the components of the transformed 
space. In order for this statement to be true, we must have: 

w(Uj) connected Vj, (C9) 

w(Uj) f]w(U k ) =0 Vj ^ k. (CIO) 
When the affine transformation w is considered only on a set Uj, i.e., 

w.Uj ^w(Uj), (Cll) 
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then w is a homeomorphism. Hence, the first required condition [C9] is automatically 
satisfied (see, e.g., Kelley 1955; Conover 1975). We now show that the second condition 
[CIO] is also satisfied: Suppose [CIO] were false. Then, by supposition, an element x 
exists such that 

x G w(Uj} n w(Uk) (C12) 
for some j 7^ k. Then, because w is a homeomorphism and thus has a unique inverse, 

^(xjEae^, (C13) 



and also 



aG U k . 



(C14) 



The intersection Uj H Uk is thus nonempty. This result contradicts the fact that Uj and 
Uk are components. Thus, by contradiction, condition [CIO] must hold. As a result, the 
distribution n(a; E) of components must be invariant under affine transformations. 

We now consider how the distribution of filaments f(a; E) transforms under an 
affine transformation. Given the definitions [2.9 - 2.11], and given that the distribution 
of components n(cr; E) is invariant under the transformation, we must calculate how the 
diameter T> and the area A of an open set Uj transform. The area A is simply the integral 
over the set and the transformation is given by equation [C5] above, i.e., 



A 



w(Uj) = \det A\A(Uj) 



(C15) 



where w is the affine transformation. Under this same transformation, the diameter of a 
set becomes 



V 



w(Uj) =max||x — y| x, y G 



Let d = x — y where x, y G w(Uj). Then 

d = w(x ) - w(y ) = A(x - y ), 



(C16) 



(C17) 



where x , yo G Uj and where A is the matrix of the affine transformation. If the trans- 
formation w is a similitude (see equation [3.6]), then it follows that 



l jl z 21 i z 
d = r x - y , 



(C18) 



where r is the scaling factor of the transformation. Since equation [C18] holds for all 
pairs of points, it follows that 



V 



w(Uj) = r l V\Uj) = I detA\ V z (Uj), 



(C19) 



a result which holds for all similitudes. Taken together, equations [C15] and [C19] imply 
that the factor Tj = t\V 2 -/AA (see equation [2.10]) is invariant under similitudes. This 
result, in conjunction with the invariance of n(a; E), implies that the distribution f(a; E) 
of filaments (see equation [2.11]) is also invariant under similitudes. • 
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We should remark that the distribution of filaments is not invariant under a general 
affine transformation (as in equation [3.5]). This claim is straightforward to verify and 
will not be discussed here. 



APPENDIX D: RESOLUTION TRANSFORMATIONS 

In this Appendix we derive an asymptotic expansion for the resolution transformation 
1Z given by 

K[<r(x)] = M j dV exp[-(x / - x) 2 /^V( x ') , ( D1 ) 
where all of the quantities are the same as those defined in the text. With the definitions 

f = (x' - x)/R, 

V = (y'-y)/R : (D2) 
we can write the resolution transformation in the form 

K[a] = MR 2 J did V expf-^ 2 + r/ 2 )] a& rf) . (D3) 

Integrals of the above form can be written in terms of an asymptotic expansion when 
the parameter A ^> 1 (see Bleistein & Handelsman 1986; see also Adams 1991). After a 
considerable amount of algebra, we obtain the expansion 



*W*.»>1 = E 7^7 A M ( „> + (D4) 



where A2 is the two dimensional Laplacian operator, i.e., 

d 2 d 2 

*> S W + W (D5) 

Notice that we have not written the expansion in equation [D4] as an infinite series, 
but rather as the — 1 th partial sum with an error term 0(X~ N ). An infinite series is 
meaningless in this context because the series need not converge (this property is a well 
known aspect of asymptotic analysis - see, e.g., Bleistein & Handelsman 1986). 
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APPENDIX E: ERROR ANALYSIS 
FOR THE DISTRIBUTION OF DENSITY 



In this Appendix we estimate how uncertainties in an observed astrophysical map 
will affect the output function m(cr; E) for the distribution of density. We also discuss 
how these uncertainties affect the determination of the corresponding coordinate r\ m . 
Using the decomposition [4.1], the function (denoted here as a m ) which appears in the 
numerator in equation [2.1] becomes 

a m = (a R + Aa)Q[a R + Aa-T,], (El) 

and the integrals appearing in the definition of the mass fraction m become 

Jd n xa m = J cTx a R [or + Aa-E] + J cTx Aa [a R + Aa - E] , (E2) 

J cTxa = J d n xa R + J ePxAa. (E3) 

Since the errors Aa are randomly distributed (and hence are equally likely to be posi- 
tive or negative), the integrals of Aa tend toward zero. Notice that the step function 
introduces a different measure of integration, but does not otherwise change this result.''' 

Thus, the distribution of density m hs that is calculated from the observed map a b s 
is related to the true distribution m R through 

m obs (cr; E) = m R (a; E), (E4) 

where we have defined 

E = E - Aa. (E5) 

In other words, the net effect of the uncertainty in the observed map is to introduce an 
uncertainty in the variable E of the output function. 

Using equations [E4] and [E5], we can calculate the associated error in the output 
function m(E) for a give map through a Taylor series expansion: 

m i? (E) = mobs(E) = m ob s(E) + — ^Aa + O [(Aa) 2 ] . (E6) 

To leading order the error in the output function is then approximately given by 

dm 



error = Am 



rfE 



Aa, (E7) 



t This statement is only correct to first order in Aa. Since the Aa integral in equation 
[E2] contains a step function which includes the error contribution Aa, noise in the map 
will push more pixels above the threshold E than below. However, this effect is second 
order in Aa. To show this, we expand the step function in the integral in equation [E2] 
to obtain 6 [a R + Aa — E] = [a R — E] — Aa 8 [a R — E] . Using this expansion in the 
original integral, we see that a Aa 2 term appears. This term is positive definite and 
hence does not integrate to zero; however, the term is second order and can be ignored. 
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which is first order in Act and therefore small if the signal to noise ratio in the original 
map is sufficiently high. 

Given the expansion [E6] for the error in the output function, we now calculate 
the corresponding uncertainty in the coordinate rj m . Using the expansion [E6] in the 
definition of the coordinate (see Paper I), we can show that 



mR 



V 2 moh s ) = -J o 2(l-m obs ) — AadZ + J 2m obs — AadE. (E8) 



Since the uncertainty Aa can be either positive or negative, the integrals in equation [E8] 
tend to cancel out. If, however, we consider the uncertainty to be bounded |Acr| < e, we 
can place corresponding bounds on the integrals: 



r c dm 
J 2(1 - m ohs )— AadZ 



< e/4, 



dm 

2m ohs — Aa dE 



< e/4, 



where we have used the fact that m(0) = 1, m(ac) = 1/2, and m(oo) 
all of the above results shows that 



(E9a) 

(E9b) 
0. Combining 



Ai] m < 



+ 0(e 2 ) + 0(A V 2 ). 



(E10) 



APPENDIX F: ERROR ANALYSIS 
FOR THE DISTRIBUTION OF COMPONENTS 

In this Appendix we discuss how observational errors in the map a can produce errors 
in the output function n(a; E) which determines the number of topological components 
as a function of threshold density E. 

As discussed in the text (§4), we must estimate the probability that a given pixel in 
a map will have an observed value E obs which is greater than a given threshold value Eq 
when in fact the true value Ex of the pixel is less than that of the threshold. We first 
assume that the probability of error in the observed value has a Gaussian distribution so 
that the probability of having an observed value E Q bs and a true value Et is given by 

p(E obs ,E T ) =A/"exp[-(E obs -E T ) 2 /a 2 ], (Fl) 

where M is the normalization factor and where a is the standard deviation. For a given 
threshold Eq, the probability Pi (Eg) that an observed value is greater than the threshold 
and that the true value is less than the threshold is given by 

P 1 (E )=Ar/ dZ ohs -— / dE T exp[-(E obs -E T ) 2 /a 2 ], (F2) 
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where we have integrated over all possible values E obs > E and all possible values 
Et < E and we have used the probability —dm/dT that the observed value is E obs . 

Although the probability Pi depends on the threshold E and on the distribution 
of density m(a; E) for the map, we can derive a useful upper bound. We first introduce 
non-dimensional quantities 

X = (E obs - T T )/a, 



a = (E obs - T )/d > 0, 
b = T ohs /a > a. 

The integral over Ej- appearing in equation [F2] can then be written in the form 



(F3) 



J a 



e. ~ x ~ dx < a^e~ a2 



(F4) 



where the final inequality is straightforward to show. Using the bound of equation [F4] 
in the expression [F2] for P±, we obtain the bound 



Pi(So) < 



2 7e 



dE 



obs 



dm 



obs 



exp[-(E obs - E ) 2 /o- 2 ] : 



(F5) 



where we have used the normalization condition for the Gaussian probability distribution 
(see equation [Fl]), i.e., 



2Aft 



/ e - y2 dv = Md^ = 1 
Jo 



(F6) 



A weaker bound on the probability P± can be found by replacing —dm/dT! by its maximum 
value and evaluating the remaining integral to obtain 



dm~\ 



dE 



(F7) 



In practice, we can use the observed distribution of density profile m(cr; E) to find 
the maximum value of —dm/dT, and thus constrain P\. However, we can obtain a crude 
numerical estimate for Pi as follows. We write 



dm 
rfE 



AE ' 



(F8) 



where AE is the overall dynamic range of the map. The ratio in equation [F8] is thus 
the inverse of the average signal to noise ratio of the map. We thus obtain the estimate 



Tat 



10" 3 ,10" 



(F9) 
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where we haved used signal to noise ratios of ~ 400 and ~ 40 to obtain the numerical 
estimates. These values are representative of the IRAS maps considered in our companion 
paper (WA). 

Thus, for a map with a large number of pixels, many pixels can be "wrong" at a 
given threshold in the sense that the pixel value appears to be larger than the threshold 
value when in fact the true value of less than the threshold. We note that this estimate 
represents an upper bound on the probability Pi and we also note that only a small 
fraction of these "erroneous" pixels will produce spurious topological components. As we 
discuss in the text, the number of spurious components (those components produced by 
pixels erroneously appearing larger than a threshold) can be controlled if we restrict our 
attention to components with three or more pixels (see §4). 



APPENDIX G: ERROR ANALYSIS 
FOR THE DISTRIBUTION OF FILAMENTS 

In this Appendix, we consider the effects of observational errors on the output func- 
tion f(a; E) which measures the filamentary nature of the map. As defined in §2, the 
filament function is a sum over all of the components of the map. For a given threshold 
level E, we separate the true components n R from the spurious ones An through 

n Q bs = n R + An, (Gl) 

where it is understood that all three quantities are functions of the threshold level E. 
The filament function can then be written in the form 

n R An 

Us = t + A ^ + i a + A ^)' (G2) 

n R + An z —' n R + An f—' 

j=0 k=0 

where AT 3 - is the uncertainty in the filament index of a given component. If we let f R 
denote the true value of the distribution of filaments, the uncertainty can be written in 
the form 

At? r 1 An i 1 ™R+ An 

A/ = /w bs= , a /it- r E^ — n- E ( G3 ) 

n R + An 1 An f—' ) n R + An 

fc=0 3=0 

where the first term in brackets is due to the uncertainty in the number of components, 
the second term in brackets is the contribution of the spurious components, and the 
final sum is the error due to the uncertainty in each individual filament index. Since 
the uncertainties ATj can be either positive or negative, this final sum will tend toward 
zero. Let us therefore concentrate on the terms in brackets. The sum in brackets is the 
filament index of the spurious components and we will denote this quantity as fs- The 
relative error in the distribution of filaments can then be written 



^ = _^L_{l-f}„^{l-M, (G4) 
f R n R + An I f R J n R I f R i 
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where we have assumed that An <C hr in obtaining the second approximate equality. As 
we argue in the text (see §4), the term in brackets is expected to be of order unity. Thus, 
the relative error in the filament index is small if the relative error in the distribution of 
components can also be made small. 



APPENDIX H: RELATIONSHIP BETWEEN THE 
DISTRIBUTIONS OF DENSITY AND VOLUME 

In this appendix, we present a simple relationship between the distribution of density 
m(cr; E) and the distribution of volume v(a; E). For any map a and for any threshold 
level E, the distribution of density m(a; E) is always greater than the corresponding 
distribution of volume v(a; E). We show that this claim is true by the following argument: 

We begin by defining an average density (a), 



(a) = J d n x a(x) where A n = J cTx , 



(HI) 



and where the integrals are taken over the entire map. The subscript n denotes the 
number of spatial dimensions (this argument holds for arbitrary n). The distribution of 
density can now be written in the form 



1 

An 



m(cr; E) 

The distribution of volume can be written in the form 

v(a; E) = — ^— / cPx Q[a - E] 
A n J 



(H2) 



(H3) 



For high threshold levels, i.e., E > (a), the quantity a(x)/(a) is greater than unity 
whenever the integrand of either equation [H2] or [H3] is nonzero. It follows directly that 



m(cr; E) > v(a; E) for E > (a). 



(H4) 



We must now consider the case when the threshold level E < (a). The derivatives of the 
two distributions (with respect to the threshold E) are given by 



dm 



dE 



dv 
dS 



(H5) 



(H6) 



For the threshold range of interest, E < (cr), the quantity a(x)/(cr) is less than unity 
whenever the integrand of either equation [H5] or [H6] is nonzero (i.e., whenever a = E). 
As a result, 



dm 



< 



dv 
dE 



(H7) 
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for the threshold range of interest. Since both functions start at unity, m(<r; 0) = 1 = 
v(a;0), and both function have negative slopes, equation [H7] implies that 

m(cr; E) > v(a; E) for E < (a). (H8) 

Thus, by equations [H4] and [H8], the distribution of density m(a; E) is always greater 
than the distribution of volume v(a; E). 
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